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Abstract. Characteristic curves of a Hamilton-Jacobi equation can be seen 
as action minimizing trajectories of fluid particles. However this description 
is valid only for smooth solutions. For nonsmooth "viscosity" solutions, which 
give rise to discontinuous velocity fields, this picture holds only up to the 
moment when trajectories hit a shock and cease to minimize the Lagrangian 
action. In this paper we show that for any convex Hamiltonian, a viscous 
regularization allows to construct a nonsmooth flow that extends particle tra- 
jectories and determines dynamics inside the shock manifolds. This flow con- 
^ sists of integral curves of a particular velocity field, which is uniquely defined 

everywhere in the flow domain and is discontinuous on shock manifolds. 

(N 

1. Introduction 

^"j 1.1. The Hamilton Jacobi equation and viscosity solutions. The evolution- 

al ary Hamilton-Jacobi equation 

(1.1) ^ + H(t,x,V<t>)=0, 

appears in diverse mathematical models ranging from analytical mechanics to com- 
binatorics, condensed matter, turbulence, and cosmology (see, e.g., a non-exhaustive 
| set of references in [l]). In many of these applications the objects of interest arc 

described by singularities of solutions, which inevitably appear for generic initial 



X 

5-H 



data after a finite time due to the nonlinearity of ( 1.1 ). Therefore one of the central 
issues both for theory and applications is to understand the behaviour of the system 
after singularities form. 

(N 



A useful example to be borne in mind when thinking about (1.1 1 — and arguably 
the most widely known variant thereof — is the Riemann, or inviscid Burgers, equa- 
tion. In the physics notation (the dot • for inner product and V for spatial gradient) 
this equation has the form 

dii 

(1.2) — +it-Vw = 0, u = V6. 

at 



The first eq. (1.2 ) corresponds to the Hamiltonian H (t, x,p) — \p\ /2. This equation 



may in turn be considered as a limit of vanishing viscosity of the Burgers equation 

(1.3) ~~dT +u * ' VuAl = / iy2wM ' mM = 



so solutions of (1.2 1 can be defined as limits of smooth solutions to (1.3) as the 



positive parameter /i goes to zero. 
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The Burgers equation is in fact very special: it can be exactly mapped by the 
Cole-Hopf transformation into the linear heat equation and therefore explicitly 
integrated, which in turn allows to explicitly study the limit p I 2 . Although 
in the case of general convex Hamiltonian the Hopf-Cole transformation is not 



available, the qualitative behaviour of solutions to a parabolic regularization of ( 1.1 ) 



(1.4) -j_ + H{t ,x,v4>n = ^ 2 ^ 

as viscosity vanishes is similar to that for the Burgers equation. The limit 4>{t, x) = 
lim^o ^(t, x) exists and is called the entropy (or viscosity) solution. 

A theory of weak solutions for a general Hamilton- Jacobi equation, employ- 
ing the regularization by infinitesimal viscosity, exists since the 1970s [3||5]. In 
the one-dimensional setting this theory is essentially equivalent to the earlier the- 
ory of hyperbolic conservation laws [2|[6jj8] . The theory of weak solutions for the 
Hamilton-Jacobi equation is closely related to calculus of variations, and introduc- 
tion of diffusion corresponds to stochastic control arguments [jjj. The viewpoint 
of the present paper is somewhat complementary: the Hamilton Jacobi equation 
is considered as a fluid dynamics model, and the main goal is to construct a flow 
of "fluid particles" inside the shocks of a weak solution. However it is convenient 
to start with the Lax-Oleinik variational pronciple, which provides a purely vari- 
ational construction of the viscosity solution. Remarkably this construction does 
not use any explicit viscous regularization. 

1.2. The variational construction of viscosity solutions and shocks. As- 
sume that the Hamiltonian function H(t,x,p) is smooth and strictly convex in the 
momentum variable p, i.e., is such that for all (i, x) the graph of H(t,x,p) as a 
function of p lies above any tangent plane and contains no straight segments. This 
implies that the formula v — V p H(t,x,p) establishes a one-to-one correspondence 
between values of velocity v and momentum p. Moreover, the Lagrangian function 

(1.5) L(t, x, v) — max \p ■ v — H(t, x,p)], 

p 

under the above hypotheses is smooth and strictly convex in v. (Note that L may 
be not finite everywhere: e.g., the relativistic Hamiltonian H(t,x,p) = yT + \p\ 2 
corresponds to the Lagrangian L(t,x,v) that is defined for |t>| < 1 as — yl — M 2 
and takes value +oo elsewhere. This does not happen if in addition one assumes 
that the Hamiltonian H grows superlinearly in \p\.) 

The relation between the Lagrangian and the Hamiltonian is symmetric: they 



are Legendre-Fenchel conjugate (1.51 to one another. This relation can also be 



expressed in the form of the Young inequality: 

(1.6) L(t,x,v) + H(t,x,p) > vp, 

which holds for all v and p and turns into equality whenever v = V ' p H(t,x,p) or 
equivalently p — V v Llt, x,v). The two maps p H> V p i?(i, x,p) and v i— > V v L(t, x, v) 
are thus inverse to each other; we will call them the Legendre transforms at (t, x) of 
p and of v. (Usually the term "Legendre transform" refers to the relation between 
the conjugate functions H and L; here we follow the usage that is adopted by 
A. Fathi in his works on weak KAM theory (10 and is more convenient in the 
present context.) 

Note that if H(t,x,p) = \p\ 2 /2, then L(t,x,v) = \v\ 2 /2 and the Legendre trans- 
form reduces to the identity v = p, blurring the distinction between velocities and 
momenta. This is another special feature of the (inviscid) Burgers equations. 



Now assume that (j>(t,x) is a strong solution of the inviscid equation (1.1), i.e., 
a C 2 function that satisfies the equation in the classical sense. For an arbitrary 
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diffcrentiable trajectory j(t) the full time derivative of 4> along 7 is given by 

d#t,7) dcj, 



(1.7) 



df 



9* 



+ 7. V0 = 7 - V0-ff(i, 7 ,V0) < L(i,7,7), 



where at the last step the Young inequality (1.6) is used. This implies a bound for 



the mechanical action corresponding to the trajectory 7: 



(1.8) 



<t>(h, 7(h)) < 0(*i,7(*i)) + / 7(«). 7(*)) da- 



Equality in (1.7) is only achieved if 7 is the Legendre transform of V0 at every 
point (i,7(*)): 

(1.9) 7(t) = V p £r(*,7,V0(i J 7)). 



Therefore the bound (1.8) is achieved for trajectories satisfying Hamilton's canon- 
ical equations, with momentum given for the trajectory 7 by p 7 (i) := V(f>(t,j(t)). 
(The second canonical equation, p = —V X H, follows from and (1.9) for a C 2 
solution d> because 



(1.10) Vl {t) 



~dt~ 



+7-(V®V0) = -V x if(t,7,V^)-Vpi/"-(V®V^)+r(V®V0), 



where the last two terms cancel.) 

This is a manifestation of the variational principle of the least action: Hamil- 
tonian trajectories (~/(t),Py(t)) are (locally) action minimizing. In particular, if the 
initial condition 



(1.11) 



0(t = O,y) = to(y), 



is a fixed smooth function, the identity 
(1.12) <Kt,x) = 0o(7(O)) 4 



£(s,7(s),7(«))ds 



holds for a Euler-Lagrange trajectory 7 such that j(t) = x and p 7 (0) = V0o(7(O)). 

However the least action principle has wider validity: in fact it can be used to 
construct the viscosity solution corresponding to the initial data ( 1.11[ ): 



(1.13) <Kt,x)= min (0o( 7 (O))+ / L(s, 7(5), 7(5)) ds 

7: ~f(t)=x\ J Q 



This is the celebrated Lax-Oleinik formula (see, e.g., [XT] or [lO]), which reduces 
a PDE problem ( |1.1| , ( |1.11[ ) to the variational problem (1.13) where minimization 
is extended to all sufficiently smooth (in fact absolutely continuous) curves 7 such 
that j(t) = x. 

At those points (t,x) where the function <fi defined by (1.13) is smooth in x, 



the minimizing trajectory is unique. In this case, the minimizer can be embedded 
in a smooth family of minimizing trajectories whose endpoints at time and t 
are continuously distributed about 7(0) and ^(t) = x (a convenient reference is 
17 Section 6.4], although this fact is classical). A piece of initial data c/>o gets 



continuously deformed according to (1.7) along this bundle of trajectories into a 
piece of smooth solution <fi to (1.1) defined in a neighbourhood of x at time t. Of 
course the Hamilton-Jacobi equation is satisfied by <fi in strong sense at all points 
where it is differentiable. 



But the crucial feature of (1.13) is that generally there will be points (t,x) with 
several minimizers 7* that start at different locations 73 (0) and bring the same value 
of action to x = Ji(t). Just as above, each of these Hamiltonian trajectories will be 
responsible for a separate smooth "piece" of solution. Thus for locations x' close 
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to x the function <p will be represented as a pointwise minimum of these smooth 
pieces fa: 

(1.14) fa[t,x') = mmfa(t,x'). 

i 

As all 7, have the same terminal value of action, all the pieces intersect at (t,x): 
fa(t,x) = fa(t,x) = ■■■ = </>(t, x). Thus the neighbourhood of x at time t is 
partitioned into domains where <fi coincides with each of the smooth functions fa 
and satisfies the Hamilton-Jacobi equation ( |1.1| in the strong sense. These domains 
are separated by surfaces of various dimensions where two, or possibly three or 
more, pieces fa intersect and their pointwise minimum <£> is not differentiable. Such 
surfaces are called shock manifolds or simply shocks. Note that a function <f> defined 
by the Lax-Oleinik formula is continuous everywhere, including the shocks; it is its 
gradient that suffers a discontinuity. 

In general, there are infinitely many continuous functions that match the initial 
condition ( |1.11[ ) and in the complement of the shock surfaces are differentiable and 



satisfy the Hamilton-Jacobi equation (1.1 1, just as 4> does. What distinguishes 



the function 4> defined by the variational construction (1.131 from all these "weak 



solutions" , and grants it with important physical meaning, is that (f> appears in 



the limit of vanishing viscosity for the regularized equation (1.4) with the initial 
condition ([lTT]) (see, e.g., |3]). For a smooth Hamiltonian it can be proved that in 
a viscosity solution minimizers can only merge with shocks but never leave them. 



Now observe that in a solution <j) given by the Lax-Oleinik formula (1.13) a 
minimizer that has come to a shock cannot be continued any longer as a minimizing 
trajectory: wherever it might go, there will be other trajectories originated at t = 
that will bring smaller values of action to the same location. Hence for the purpose 



of the least action description (1.13), Hamiltonian trajectories become irrelevant as 
soon as they hit shocks. The set of trajectories which survive as minimizers until 
time t > is decreasing with t, but at all times it is sufficiently large to cover the 
whole continuum of final positions. 

1.3. The fluid dynamics picture. Let us now adopt an alternative viewpoint and 
consider the Hamilton-Jacobi equation as a fluid dynamics model, assuming that 



Hamiltonian trajectories (1.9) are described by material "particles" transported by 
the velocity field u(t,x), which is the Legendre transform of the momenta field 
p(t,x) — V(f>(t,x). From this new perspective it is no longer natural to accept 
that particles annihilate once they reach a shock. Can therefore something be said 
about the dynamics of those particles that got into the shock, notwithstanding the 
fact that their trajectories cease to minimize the action? The difficulty in such an 
approach is related to the discontinuous nature of the velocity field u, which makes 
it impossible to construct classical solutions to the transport equation j(t) = u(t, 7). 

In dimension d = 1 the answer to the question above is readily available. Shocks 
at each fixed t are isolated points in the x space and as soon as a trajectory merges 
with one of them, it continues to move with the shock at all later times. This de- 



scription is related to C. Dafermos' theory of generalized characteristics 12 which, 
in fact, can be extended to a much more general situation of nonconvex Hamil- 
tonians and systems of conservation laws. However, in several space dimensions 
shock manifolds are extended surfaces of different codimension, and dynamics of 
trajectories inside shocks is by no means trivial. 

Our interest in this problem is related to the earlier work by I. Bogaevsky where 



the problem was solved for the case of the Burgers equation (1.3) 13 ,14 using the 
following approach. Consider the differential equation 

(1.15) ^(t)=u»(t,r), r(0)=y. 
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FIGURE 1. Bogaevsky's construction 13,14 of the effective velocity v 
at a triple shock point in dimension d = 2: (a) the local structure of the 
flow (V, v", and v" are the limiting values of velocity when the triple 
point is approached from three different domains of smooth flow); (b) the 
effective velocity v is the center of the smallest circle containing the three 
limiting velocities; (c) the smallest circle (dashed) is not necessarily the 
circumscribed one (dotted), so the effective velocity may be determined 
by a proper subset of limiting velocities (here, v' and v"). 



Since for /i > is a smooth vector field, this equation defines a family of 
particle trajectories that form a smooth flow. The next step is to take the limit 
of this flow as /i J, 0. ft turns out that this limit exists as a non-diffcrcntiablc 
continuous flow, for which the forward derivative j(t + 0) — lirn T j,o [7 (t + T) — 7(£)]/t 
is defined everywhere. If 7(f) is located outside shocks, this derivative coincides with 
u(t,j(t)). Otherwise the effective velocity j(t + 0) is determined by the extremal 
values of velocities Uj = V^>j, and there is an interesting explicit representation for 
it: j(t + 0) coincides with the center of the smallest ball that contains all itj (fig. [l}. 
The limiting flow turns out to be coalescing (and therefore not time-reversible): 
once any two trajectories intersect, they stay together for all later times. 

Moreover, it turns out that pieces of the shock manifold may be classified into 
restraining and nonrestraining depending on whether trajectories stay on them 
or leave them along pieces of shock manifold of lower codimensiorQ Shocks of 
codimension one are always restraining; in particular, such are all shocks in the 
one-dimensional case. Interestingly, this classification, introduced for the first time 
by Bogaevsky in [13] ("acute" and "obtuse" superdifferentials of 4>) seems to have 



been overlooked by physicists despite its clear physical significance 15 



The proofs of all these facts in 13 14 were based on specific properties of the 



quadratic Hamiltonian and cannot be extended to the general setting of a convex 
Hamiltonian. In this work we propose a different approach to the vanishing viscosity 
limit in the general setting. This approach, which is based on the fundamental 
uniqueness of the possible limiting behaviour of 7^, leads to results on existence, 
uniqueness, and explicit representation of the limit velocities. 



It should be remarked that equation (1.9), which relates the velocity 7 of a 
trajectory to the gradient V(f>(t, 7) of the solution, can be seen as defining a gen- 
eralization of the gradient flow of the function </> [l0jfl4]. Such a flow coincides 
with the conventional gradient flow when H(p) = \pf~/2 and <j> is smooth. The case 
of a concave (or semiconcave) nonsmooth (f> can be handled using the differential 
inequality that goes back to the work of H. Brezis [16] . A similar approach was also 
used by P. Cannarsa and C. Sinestrari in the context of propagation of singularities 



x In the example of fig. [l] case (c) corresponds to a nonrestraining triple point in d = 2, 
which trajectories leave through the shock line that divides domains of smooth flow with limiting 
velocities v' and v"; see fig. 3 from [13] and the discussion therein. 
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for the eikonal equation with quadratic Hamiltonian 17 Lemma 5.6.2]. However 



the classification of singluarities into "restraining" and "nonrcstraining" seems to 



have been unknown before the work 13 even in the quadratic case. 

We conclude with a few observations concerning our approach. 

Seen as a family of continuous maps of variational origin from initial coordinates 
y to current coordinates x, the flow j y is clearly relevant for optimal transportation 



problems 23 24 . An interesting problem suggested by B. Khesin is to study the 
extremal properties of this flow. Indeed it is known from [26] that before the 
first shock formation the flow ~f y is and action minimizing flow of diffcomorphisms, 
while the first shock formation time t* marks a conjugate point in the corresponding 
variational problem. According to the suggested view, the flow constructed above 
may be seen as a kind of saddle-point, rather than minimum, for a suitable transport 
optimization problem. 

Another natural context to place our construction in is that of differential inclu- 
sions (see, e.g., [27|). The flow consructed in this paper may be seen as a solution 
of differential inclusion 

(1.16) T£ V p ff(i )7 ,Prp^(t,7)), 

where Pr p is the p projection of the superdifferential dip. In comparison with stan- 



dard constructions of the theory of differential inclusions the flow ~f y solves (1.16) 
in a stronger sense: the forward derivative jy(t + 0) exists everywhere. 

1.4. Outline of the paper. In Section[2]we develop a local theory for Lagrangian 
particles in a gradient flow defined by a viscosity solution <p. Here we introduce 
the notions of admissible velocity and admissible momentum at a shock, which are 
central to our approach. The admissible velocity at each point turns out to be the 
unique solution to a particular convex minimization problem, which extends the 
construction of the center of the smallest Euclidean ball (cf fig. [I]) to the general 
convex case. 

In Section [3] we show that the limit of a flow regularized with small viscosity is 
tangent to the field of admissible velocities. This establishes an existence theorem 
for integral curves of this field. 

The issue of uniqueness of the limiting flow and a perturbative approach that 
allows to determine the higher time derivatives of limiting trajectories are discussed 
in Section |U 

In the course of this work we benefitted from valuable discussions with Jeremie 
Bee, Patrick Bernard, Ilya Bogaevsky, Yann Brenier, Philippe Choquard, Michael 
Dabkowski, Uriel Frisch, and Boris Khesin. It is a pleasure to recognize their help 
as well as the unique environment of the Observatoire de la Cote d'Azur, where 
this work has started and advanced. 



2. Viscosity solutions and admissible gradient vector fields 

2.1. Superdifferentials of viscosity solutions. Let <j> be a viscosity solution 
to the Hamilton-Jacobi equation ( |1.1[ ) with initial data ( |1.11[ ). We shall use the 
following standard facts, for which we refer the reader again to the recent and very 
useful exposition in [l7j Section 6.4], although many of these facts date from 50 
and more years ago: (i) the function (f> is locally uniformly semiconcave in (t, x) 
variables; (ii) if there is a single minimizer coming to (t, x), then cj> is differentiable 
at this point, C 2 smooth in some its neighbourhood, and 

(2.1) d>{t +r,x + i) = cj>(t, x) + |J r + ■ £ + o(\r\ + |£|) 

(2.2) = cj>{t, x) - H(t, x, V<t>) r + ■ £ + o(|r| + 
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(iii) if (f> is not differentiable at (t, x) and there is a finite number of minimizers ji 
such that 7i(£) = x, then each of them corresponds to a different smooth branch fa 
of solution defined in the neighbourhood of (t,x). Then the Lax-Oleinik formula 
implies that 

(2.3) <j)(t + T,x + £) =min0 l (t + r,.T + O 

i 

(2.4) = 4>(t, x) + mm{-H lT + Pl • f) + o(\t\ + |f |), 

where ft := V4>i(t,x) and i?; := H(t,x,pi). 

In the latter case neither of expressions —HiT + provides a valid linear 

approximation to the difference <j)(t + r, x + f) — (f>(t,x) at all points, but they 
all majorize this difference up to a remainder that is cither linear or higher-order, 
depending on r and £. Evidently, so does the linear form —Hr+p-t; for any convex 
combination 

(2.5) P = ^Aip i; H = Y,hHi 

i i 

with Ai > 0, \j = 1. In convex analysis these convex combinations are called 
supergradients of (j) at (t, a;) and the whole collecton of them, which is a convex 



polytope with vertices (— is called the superdifferential of </> 17 18 . We use 
for the superdifferential the notation d<fi(t,x). 

To avoid a possible misunderstanding it should be noted that, although unique- 
ness of a minimizer coming to (t, x) implies differentiability in a; of a nonsmooth 
solution 4> to the Hamilton- Jacobi equation at t and earlier times, it does not imply 
its differentiability at any t + r > t. Such points (f, x) where the differentiability 
cannot be extended to an open neighbourhood in spacetime are called preshocks 
[l] and correspond to conjugate points of a corresponding variational problem; a 
classification of all the possible combinations of shocks and preshocks in dimensions 
d = 2 and d = 3 is provided in [19 . Note that at a preshock the linearization of 
4>i does not fully anticipate the shocks at times t + t for any r > 0. However the 
differentiability is recovered as r J, 0, because the corresponding superdifferential 
shrinks to the gradient of cf> at (t, x) . 

Under a viscous regularization ^ of the solution <f>, a shock point (t, x) is 
"smeared" over a small area where (difi^/dt, V<^) takes on all values from rela- 
tive interior of d<j){t,x). Thus, intuitively, d(j)(t,x) is a set of values taken by the 
spacetime gradient of the nonsmooth function 4> in an infinitesimal neighbourhood 
of (t,x). 

Completing the gradient field with superdifferentials at points where <f> is not 
smooth recovers, in a weaker sense, the continuity of the map (t, x) t— > d<p{t,x). 
Indeed, suppose (t n ,x n ) converges to (t, x) and the sequence (—H n ,p n ) S d<f>(t„, x n ) 
has a limit point (—H,p). By definition of superdifferential, 

(2.6) (tfjbn + r, x n +£,)- (f>(t n , Xn) < -H n T+p n -£,+ o(\t\ + If I); 

passing here to the limit and using continuity of 0, we see that (—H,p) € d<p(t,x). 
Therefore the superdifferential d<j)(t, x) contains all the limit points of superdiffer- 
entials d(f>(t n ,x n ) as (t n ,x n ) converges to (t, x). 

To make this argument rigorous, some control is needed over the remainder term 



in ( 2.6 1. This is easy for convex or concave functions 18 , for which such inequalities 
hold without remainders. A wider function class, which contains viscosity solutions 
of Hamilton-Jacobi equations and in which such control is still possible, is formed 
by semiconvex or semiconcave functions [IT] . We refer a reader interested in proofs 



of this and other convex analytic results used in this paper to monographs 17 18 . 
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2.2. Admissible velocities and admissible momenta. This section will de- 
scribe a procedure that gives a unique possible velocity and momentum at each 
point (i, x). The construction is based solely on the convexity of Hamiltonian in 
the momentum variable. We thus set the stage for the next section where the 
connection with the viscous regularization is established. 

If (t, x) is a regular point, i.e., not a point of shock, then the velocity u(t, x) and 
the momentum p(t, x) are naturally defined. Suppose now that (t, x) is a point of 
shock formed by intersection of smooth branches (pi, i £ 3. For a particle starting 
from a shock point (t, x) its possible velocity v must correspond to one of the 
the "available" momenta, i.e., to a momentum p that belongs to the convex hull 
of the momenta pi — W(j>i{t,x), i £ 3, or equivalcntly to the p-projection of the 
superdifferential d(p(t,x). 

In fact more can be said. For an infinitesimal positive r, when the particle has 
already left its original location with velocity v, not all branches (pi will be relevant 
for the solution (p at a point (t+r, x + vt) as r J, 0, but only those that contribute to 
the linear approximation of the solution, i.e., the minimum in minj e ;j(— Hi +pi ■ v) 
(cf. ( |2.4[ ) with £ = vt). The branches not contributing to the minimum can be 
discarded. 

Denote the set of relevant indices 

(2.7) I(v) := {j £ 3: - Hj +p r v = mm(-H t + Pi ■ v)}; 

it is nonempty because the minimum is attained due to convexity of H(t, x, •). 

We can now postulate that any possible velocity v of a Lagrangian particle inside 
a shock satisfies the following condition. This postulate is justified by Lemma [2] 
in 

Admissibility condition. A velocity v* is said to be admissible at (t, x) if the 

corresponding momentum p* = V v L(t,x,v*) belongs to the convex hull of momenta 
Pi with i £ I{v*): 

(2.8) p* £ conv{ Pj : j £ I(v*)}. 

This value of momentum p* is also called admissible at (t,x). 

Observe that since the index set I(v) depends on v, this condition can be viewed 
as a kind of self-consistency requirement. 
Equivalently, one can write 

(2.9) v* £ W p H{t, x, conv{p 3 : j £ I(v*)}). 

Note that, in contrast with theory of generalized characteristics for Hamilton- Jacobi 



equations |17,, Definition 5.5.1], there is no convex hull taken in (2.9) after the 
(nonlinear) map V p H(t,x, ■) is apllied to the superdifferential of at (t,x), even 
though the resulting set in the velocity space is generally non-convex. 



It is known 20 Example 3.5] that with an extra convex hull operation men- 
tioned above, uniqueness may fail for the velocity of a generalized characteristic. 
In contrast, the above definition allows to fix the velocity v* uniquely: 

Theorem 1 (uniqueness of admissible velocity). Let <f> be a viscosity solution to the 
Cauchy problem Then at any (t,x) there exists a unique admissible 

velocity v* — v*(t,x), which is the unique point of the global minimum for the 
function 

(2.10) L(v) := L(t, x, v) - min(-iJj + Pi ■ v). 
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Proof. Recall that L(t, x, v) is a strictly convex function of v because of assumptions 
formulated in <|T] Rewriting 

(2.11) Li(v) := L(t,x,v) + Hi — pi ■ v, L(v) — max Li(v), 

we see that L(y) is a pointwise maximum of strictly convex functions and therefore 
is strictly convex itself. Furthermore, because the Hamiltonian H(t, x,p) is assumed 
to be finite for all p, its conjugate Lagrangian L(t, x, v) grows faster than any linear 
function as |u| increases, and thus all its level sets are bounded. Therefore L(v) 
attains its minimum at a unique value of velocity v* . 

To simplify the presentation of ideas we start with an (elementary) proof of the 
theorem in the particular case when I(v*) is finite. We show first that the point of 



minimum v* satisfies the admissibility condition (2.8 1. Indeed, 

(2.12) V„£i(v*) - V„L(t, x, v*) - Pi = p* - Pi . 

Suppose that p* does not belong to the convex hull of pj, j € I(v*). Then there 
exists a vector h such that (p* — pj) ■ h < for all j € I(v*). It follows that 
Lj(v* + eh) < Lj(v*) for all j G I(v*) if e > is sufficiently small. Hence, L(v* + 
eh) < L(v*) for sufficiently small e, which contradicts to our assumption that v* is 
a point of minimum. This contradiction proves that v* is admissible. 

To prove uniqueness we show that if v is admissible then it is a (necessarily 
unique) point of global minimum for the strictly convex function L. Using the 
strict convexity of Lj, we obtain 

(2.13) Lj(v + h) = L(t, x,v + h) + Hj - pj ■ (v + h) 

(2.14) > Lj(v) + V v L(t,x,v) -h-pj-h = Lj(v) + (p-pj) ■ h, 

where p is the Legendre transform of v. Since v is admissible, p — Y] ^ ^jPji where 
all Xj > and J2j A i = 1 - Hence, £\ Xj(p - p 3 ) ■ h = [(^ Xj)p - Y, } X jPj ] ■ h = 
\p — p] ■ h = 0. It follows that (j) — Pj) ■ h > for at least one j £ I(v). This implies 
that L(v + h) > L{v), which means that v is a point of global minimum for L. 

In the general case of an arbitrary I(v*), only the first argument, namely admis- 
sibility of the global minimum v* , needs modification. We shall us the following 



result of Clarke based on earlier work of Ioffe and Levin 21 theorem 2.8.2 and 
corollary 1]). Let L(v) = maXj e ;j Li{v), where 3 is a compact topological space, 
and suppose that all functions Li(-) are convex and Lipschitz with the same constant 
and I(v) is the set of i's for which the maximum is attained; then the subdiffer- 
ential dL(v) is a weakly-* closed convex hull of the union of dLi(v) for i G I(v). 
(To justufy compactness of 3, observe that one can use the values of momenta pi 
instead of the abstract indices i and that the set of minimizers coming to (i, x) is 
closed and their momenta are bounded.) Now take into account that in our case 
dLi(v) — V v L(t,x,v) — pi and that v* is the point of minimum, i.e., that 

G dL(v*) = com{V v L(t,x,v*)-pi-. i e I(v*)} = {p* -pv.ie I(v*)}; 



this coincides with the admissibility condition p* € convjpi : i g I(v*) } (2.8). □ 



Thus the admissibility property, first formulated above in the hardly manageable 



combinatorial form (2.8), turns out to be the optimality condition for a nice convex 



minimization problem (2.10). In particular, if <j) is differentiable at (t, x), then 



L(v) — L(t,x,v) + H(t,x,V(j)) — V<p ■ v and the minimum in (2.10) is achieved at 
the Legendre transform of V^. We thus recover Hamilton's equation (1.9). 

The following reformulation will clarify the connection between admissibility and 
the original construction for the Burgers equation proposed by Bogaevsky in |13l|14| . 



Let Vi = V p H(t,x,pi) be the velocity corresponding to the limit momentum pi 
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and observe that pi — W v L{t,x,Vi). The Legendre duality implies that Hi 



H(t,x,pi) = pi ■ Vi — L(t,x,Vi) and therefore (2.11) assumes the form 
(2.15) 

L(v) — max [L(t, x, v) — L(t, x, Uj) — V. u L(t, x, Uj) • (v — v^] — max D£ x (v \ Vi). 

The quantity in square brackets is known as the Bregman divergence D£ x (v \ Vj) of 
vector v with respect to a non-symmetric measure of separation of vectors with 
respect to the convex function L(t,x,-) [22'. Theorem [I] terefore means that the 
admissible velocity is the center of the the smallest "Bregman sphere" containing 
all Vi, i € 3. When L(t,x,v) = \v\ 2 /2, the Bregman divergence reduces to (half) 
the squared distance between the two vectors. Therefore the admissible velocity v* 
exactly conicides with the centre of smallest ball containing all w i; and we recover 



the result of 13 14 



Finally, let us discuss the "physical" meaning of the function L. Consider an 
infinitesimal movement from (i, x) with velocity v. It follows from the least action 
principle that <j>(t, x) + L(t, x, v) dt — 4>{t + dt, x + v dt) > 0. It is easy to see that 
to the linear order in dt 

(2.16) <j>(t, x) + L(t, x, v) dt - 4>{t + dt,x + v dt) = L(v) dt. 

Hence the unique admissible velocity v* minimizes the rate of growth of the differ- 
ence in action between the true minimizers and trajectories of particles on shocks. 
In other words, the trajectory inside a shock cannot be a minimizer but it does its 
best to keep its surplus action growing as slowly as possible. 

3. The vanishing viscosity limit for velocities 

In the preceding section we constructed a canonical vector field of admissible 
velocities v* (t, a;) = V ' p H(t, x, p* (t, x)) that corresponds to a given viscosity solution 
cf> of the Cauchy problem ( |1.1[ ), ( |1.11[ ). Notice that in general this vector field is 
discontinuous on the shock manifold. 

To see how the vector field of admissible velocities arises for Lagrangian particles 
inside shocks, consider the vanishing viscosity limit for a flow corresponding to the 
parabolic regularization 

(3.i) ^ + H(t, x ,vr) = ^ 2 r, m>o, 



of the Hamilton- Jacobi equation (1.1 1. 



For sufficiently smooth initial data 4>o(y) = ^(t = 0, y) the partial differential 



equation (3.1) has a globally defined strong solution M , which is locally Lipschitz 
with a constant independent of fj,. Moreover, ty* converges as fx I to the unique 
viscosity solution <j> corresponding to the same initial data. Proofs of these facts 
may be found, e.g., in [I], where they are established for tfio € C 2 ' a . 
Consider now the transport equation 

(3.2) r(t) = v P H(t, y , v^(t, 7 M )), y (0) = V- 

For /i > this equation has a unique solution which continuously depends on the 
initial location y. Fix a point (to,Xo) with to > and pick trajectories 7 M for all 
sufficiently small /i > such that 7 M (to) - > £o as fi I 0. The uniform Lipschitz 
property of solutions y implies that the curves 7 M are uniformly bounded and 
equicontinuous on some interval containing t . Hence there exists a curve 7 and 
a sequence 1 such that lim^.^o 7 Mi = 1 uniformly in t on that interval. Note 
that all 7 Mi and 7 are also Lipschitz with a constant independent of /1 and that 
7(i ) = x . 
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Let furthermore v be a limit point of the "forward velocity" of the curve 7 at 
(to,Xo), i.e., let for some sequence r^, | 

(3.3) v = lim — [j(t +Tk) -7(*o)]. 

TklO Tfc 

We cannot conclude a priori that the curve 7 or the velocity v are uniquely defined. 
However it turns out that v must satisfy the admissibility condition with respect 
to the solution cj) and therefore it coincides with the unique admissible velocity v*. 

Theorem 2 (admissibility of limit velocities). The momentum p — \7 v L(to, xq, v) 
corresponding to a limit velocity v in (3.3 I is admissible at (to,Xo), i.e., it satisfies 
p G convjpi : i € /(£>)}• 

Proof. Our general strategy in what follows is a proof by contradiction: assume 
that v is not admissible and show that it then cannot be a limit velocity. 

We first set up some notation regarding geometry of the closed convex set 
d(/)(to,xo). Denote by (s,p) the space-time co-tangent coordinates, with s a scalar 
dual to the time subspace and p a vector dual to the d-dimensional configuration 
subspace. Let A c be the hyperplane supporting the convex compact set d(/)(to,xo) 
from below with the slope corresponding to the velocity v: 

(3.4) A c = {(s,p) : s — —p ■ v + min(pi • v - H,)} 

and let S be the intersection of A and of the superdifferential d<fi(to,xo), i.e., 
the face of d4>(to,xo) spanned by vertices (—Hi,pi) with indices in I(v) [cf. equa- 
tion ( 2.7 1] . In this geometric setting the admissibility condition for momentum ( 2.8 ) 
can be formulated in the following way: an admissible momentum must belong to 
the p-projection of the set S. Denote also $ := mmi^(pi ■ v — Hi). 

Lemma 3. If p does not belong to the p-projection ofS, then 

(3.5) M :— min [s +p ■ v - $ + - p) ■ {V p H(t , x ,p) - v)] > 0, 

(s,p)ed4>(t ,x ) 



Proof. It should be noted that M in (3.5| is an auxiliary quantity, which plays 
role in the subsequent proof but has no geometric meaning by itself. It can be 
seen as a sum of two parts, each of which is nonnegative for reasons related to 
the convexity of the Hamiltonian H and the superdifferential d<p{tQ, x ), and which 
cannot simultaneously vanish. 

Denote s — — p-v+$ and observe that the point (s,p) cannot belong to d(j){to, xq) 
because (s,p) € A c but the p-projection of the face S = A c n d(f>(t ,xo) does not 
contain p. 

Monotonicity of the gradient V p H(t ,xo,p) of the convex function H implies 
that for p 7^ p 

(3.6) ( P -p)-(V p H(t ,x o ,p)-v)>0. 

Indeed, from the strict convexity of H(to, xq, •) in momentum it follows that 
H{t Q ,x a ,p) > H(t ,x ,p) + (p-p) ■ W p H(t ,x ,p), 
H(t a ,x ,p) > H(t Q ,x ,p) + {p-p) ■ W p H(t ,x ,p) 

whenever p ^ p and in particular when (s,p) € S. Adding these two inequalities 
and taking into account that V p H(to, xq,P) — v, we get (3.6). 

Furthermore, as A c supports d<p(tQ,xo) from below, for all (s,p) £ d(f)(to,Xo) we 
have 

(3.7) s+p-v-$>0 
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with equality only when (s,p) G A c . Thus the function of (s,p) in the square 



brackets in (3.5) is strictly positive on dcj>(to,xo). Indeed, if (s,p) belongs to the 



face S, then (3.6) is positive, and otherwise (13. Tb is positive. □ 



In the rest of the proof it will be convenient to use a different rearrangment of 



the expression in square brackets in (3.5) 



(3.8) [I] - [II] := [s + (p - p) • V P H (t , x 0) p)} - [$ - p • v]. 

Next we provide a precise meaning to the intuitive idea that for (t, x) sufficiently 
close to (to,Xo) and /i sufficiently small, the values of the function (f>^ and its 
derivatives are close to those for the linearization 

(3.9) 4>(t , x ) + win \pi ■ (x - x ) -(t- to) H t } 

of the viscosity solution (f> near (to, Xo). 

For e > let V e be the e-neighbourhood of d(j)(t ,x ). Choose e < M/(6 + 3|u|) 
so small that (s,p) ^ V e and 

(3.10) min ([I] - [II]) > 2M/3 > 0. 

(s,p)ev c 

Using the upper semicontinuity of the superdifferential (see e.g. [Tt[ Proposition 
3.3.4] or [18 | Corollary 24.5.1], where a similar result is proved for convex functions), 
choose R = R(e) > and T — T(e) > such that for all (t,x) G T> t ,r := 
{(t, x) : < t — t < T, \x — x \ < R} the superdifferential d<fi(t, x) is contained in 
the set V € /2- 

Reducing T, R if necessary and using the Lipschitz property of <f>, ^ (which 
implies boundedness of momenta) and continuity of V p H(t, x,p) in (t, x) variables, 
we can assume in addition that for all (t, x) G 23t,j? 

(3.11) |(p-p)- V p H(t,x,p)-(p-p)- V p H{t ,x ,p)\ <e 

Denote T(t) := xq + v(t — to). Reducing T once again, we can guarantee that 
for all t < t < t + T both |r(f) - T(t )\ < qR and \j(t) - j(t )\ < qR with 
any < q < 1 (this margin is needed because we will approximate 7 by 7^, which 
must also belong to Dt,r) and that, moreover, d(j)(t,F(t)) is contained in S e /2, the 
e/2-neighbourhood of S. The latter is possible because all limit points of dcf>(t, T(t)) 
as t J, to belong to the face of d<fi(to,xo) that corresponds to the direction v, i.e., 
to S. A proof of this result, which refines the upper semicontinutity property 
of superdiffcrcntials mentioned above, can be found e.g., in the context of convex 
functions in (l8j Theorem 24.6]; its generalization to the semiconcave case is evident. 

In what follows we will refer to the values of fa from the sequence that de- 
termines 7, but will drop the index i to simplfy the notation. Choose /2 = /2(e) 
sufficiently small so that the following three conditions hold: 

(t, 7(t)) G T)t,r for/i</x(e) 

(this is indeed possible because (t,"f(t)) £ ^>T, q R with q < 1), 

(3.12) (^(i,x),vf(i,x))ey ( 

everywhere in T>t,r, and 

(3.13) (^(t,r(i)),v^(t,r(t))) G5 e 

for to < t < to + T. The latter two conditions hold because convergence of semi- 
concave functions ^ to <f> implies that limit points of their derivatives belong to 
d<f>(t,x) C V e /2 (in particular, d4>(t,T(t)) C S e / 2 along the trajectory T). 
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We are now set for the concluding argument. Assume that v is not an admissible 
velocity and therefore the correspondent momentum p does not belong to the p- 
projection of S. We are going to show that in this case, although trajectories 7^ 
may occasionally pass close to the trajectory T(t) — x + v(t — t ), any possible 
limiting value of velocity of the limit trajectory 7 as r = t — to ^0 differs from v 
by a positive constant. The central argument is provided by the following lemma. 

Lemma 4. Under conditions of Lemma^fix arbitrary positive r < T/3 and 8 < 
RI/[6(L + \p\)], where L is the common spatial Lipschitz constant of (ft 1 in Dt,r 
for < fx < p. Define the cone Kg := {(t,x) G T>t.r'- \x — r(t)| < S(t — to)} and 
suppose that (to + r, 7 (to + r)) G Kg. Then (t,7 M (t)) ^ Kg for all /i < p andt such 
that 3r < t - t < T. 

Proof. The full time derivative of the function (t, x) h-> (f>^(t, x) — p ■ x along 7 M is 
given by 

(3.i4) ^(t,7"(t)) -P-rm = ^-(*,y) + (W(*,y) -p) ■ r 



dt 



(t, y) + (V<^(t, r) - P) ■ V P H(t, 7 M , W(f, 7 M )) 



> -j^(t, y) + (V^(t, y) - P) • V p ff (t , 10, W(f, 7 M )) - e, 

where the last inequality follows from ( 3.11[ ) . Integrating this from to + r to t we 
get 

[I] ^ (t , y (t) ) - p ■ r (t) - r (to +r,r(to+r))+p-r (*o + t) 



> 



t +T 



dt 



it', y) + (V^(f, 7 M ) - p) ■ V p ff (t , x , V<^(t', y)) 



e(f - t - r) 



On the other hand 
d 



dtp 



(3.15) f t {r (t, r(t)) - p • r(t)] = ^(t, r(t)) + W(t, r(t)) -p)-v 



<$-p-v + e(l + \v\), 



where we took into account (3.13) and the fact that s + p ■ v — $ for all (s,p) G S 
(cf. J3?7|). It follows that 

[II] <^(t, r(t)) - p • T(t) - <^(t + r, r(t + r)) + p • r(t + r) 



< 



($ -p • v) dt' + e (1 + \v\)(t -to- r). 



t +T 



Subtracting [II] from [I], using (3.8), (3.10), (3.12) and observing that the Lip- 
schitz property of M (and correspondingly that of x i— > ^(t^x) — p ■ x, with the 
constant L + \p\ ) implies that 

(3.16) |0"(to + r, r(t + r)) - p • T(t + t) 

-^(to + r,7^(t + T))+p- 7 ' J (to + r)| < (X + |p|)5r, 

we get 

(3.17) r(t, ^(t)) - p ■ r(t) - ^(t, r(t)) + p ■ v(t) 

> [|M - e(2 + \v\)](t - t - t) - (L + \P\)St. 
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Using again the Lipschitz property and the inequality e < M/(6 + 3|v|), we get 



§M - e(2 + ^ ± N r ^ M 



(3.18) | 7 M W-r(t)| > 3 L + w {t-t Q -r)-8r > (t-t Q -r)-Sr. 

Since 5 < M/[6(L+ \p\)], this means that 7^(t) stays outside K$ for t — to > 3t. □ 

We can now conclude the proof. Suppose that there is a sequence t% 1 to such that 
(ti,j(ti)) £ Kg. Then for all sufficiently small fi Lemma |4] implies that (t,7 M (t)) ^ 
Ks when t > to + 3(t; — to) for all i, which means in turn that (t,7(t)) also cannot 
belong to Kg for such t. As f, 4- the trajectory 7 has to stay outside Ks for 
all to < t < to + T, a contradiction with what has been assumed. This proves 
Theorem H □ 

4. Integral curves of the field of admissible velocities 
4.1. The issue of existence and uniqueness for integral curves. We have 



seen that limit trajectories 7 of solutions 7 M to the transport equation (3.2) arc 
tangent in forward time to the discontinuous field of admissible velocities v* , i.e., 
that j(t + 0) = v*(t,x) for any t and for any limit trajectory 7 passing through 
some x = j(t). This however does not imply that limit trajectories are unique. 

There are in fact two different uniqueness problems: that for limit trajectories 
as n — > for the viscous regularization (|3.2|), repeated here for convenience: 



(4.1) r(t) = v P H(t,^,vr(t,in), 

and that for integral curves of the differential equation 

(4.2) 7(i + 0) =«*(*, 7). 



Since any limit trajectory of (4.1) is an integral curve of (4.2) according to Theo- 
rem [2j uniqueness for integral curves would imply uniqueness for limit trajectories. 
However, it is a priori possbilc that more than one integral curve passes through 
the same singular point, but the vanishing viscosity regularization selects only one 
among these curves as a limit trajectory. 

Uniqueness of limit trajectories can be established in the case when the Hamil- 
tonian is quadratic in the momentum variable. This follows from a particular 
differential inequality for the squared separation between two close trajectories, 



first established in 16 , which allows to control the expansion of the distance in 
terms of the semiconcavity constant of the solution 4>. Indeed, two limit trajec- 
tories passing through the same point (t, x) cannot diverge by a finite distance in 
finite time, because their viscous regularizations must stay arbitrarily close to one 
another over this time interval, provided these regularizations are close enough at 



time t. Hence, as observed in 13 14 , the limit flow 7 is defined uniquely and is 
therefore coalescing: once two trajectories intersect, they stay together at all later 
times. 

However an analogue of the key differential inequality is not known for arbitrary 
convex Hamiltonians. Observe also that the argument just outlined bypasses the 
issue of integral curves altogether. It is therefore interesting to consider the exis- 



tence and uniqueness issues for the differential equation (4.2 ) irrespective of viscous 
regularization. Here we will restrict ourselves to formal arguments based on rather 
generous regularity assumptions and aiming to convey the intuition of what is going 
on. 

Let the shock manifold of 4> be locally finite, i.e., suppose that at each shock 
point (to,xo) there is a finite number k of minimizers connecting that point with 
the initial data. This implies that in a neighbourhood of (to, xq) the solution 4> may 
be represented locally as a pointwise minimum of a finite number k of C 2 smooth 
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branches 4>i , each of which satisfies the Hamilton-Jacobi equation classically, and 
that locally the shock manifold is composed of C 1 smooth pieces of different di- 
mensions. (The case of preshocks, introduced on p. [7j provides an exception to the 
condition of smoothness and should be considered separately.) 

One can show that on each smooth piece of the shock manifold the spacetime 
field (l,u*) determined by admissible velocities is a Lipschitz vector field tangent 
to the piece. The usual ODE arguments then show that the flow generated by the 
vector field v* is uniquely defined on smooth pieces of the shock manifold, as well 
as in the bulk where the solution 6 is smooth. 



In Section 2.2 it was shown that at a shock point (to,xo) not all of the inter- 
secting branches <j>i of solution are relevant for the integral curve 7 at times t > to, 
but only those with i 6 I{v*), i.e., those that are relevant in the first-order (lin- 
ear) approximation to both the solution </> and the integral curve 7. Denote the 
corresponding index set with 3\ := I(v*). 

Uniqueness of integral curves can only fail at shock connections: there must be 
at least two pieces of shock manifold that have a common point (to,xo) and share 
the same tangent spacetime direction (l,v*) but at later times carry two disjoint 
trajectories both issued from xo at time to with velocity v* . Note that this is not 
possible if \3±\ < d+ 1, where d is the spatial dimension, and the velocities Vi are 
in general position: indeed, in this situation removal of any branch <j>i with i E 3i 
would change the admissible velocity v* . 

In fact a (formal) perturbative analysis of an integral curve 7 in higher orders of 
approximation reveals a nested sequence of finite index sets 3\ D 3% D . . . such that 
3k lists branches relevant for the integral curve in fcth order, and the intersection 
3 = C\k>i3k is not empty (i.e., the sequence stabilizes). In particular if |3| < d+ 1, 
then the integral curve 7 is defined uniquely. 

In what follows we illustrate this procedure in the second order and obtain 32- 

4.2. Admissibility in the second order of perturbation theory. Take an 
integral curve 7 such that 7 (to) = ^0 an d assume it to be twice differentiable at to'. 

(4.3) 7 (t) =x + (t- foK + + o((t- to) 2 ) , 

where v* is the vector of admissible velocity at {to,Xo) and a is the yet unknown 
acceleration of 7 at t - At times t = t + t with sufficiently small r > the point 
j(t) lies at intersection of a possibly smaller set of branches <f>i, which all have the 
same value at (t, 7(i)). The first two time derivatives of this common value along 
7 can be expressed as follows. 



Using the Hamilton-Jacobi equation (1.1 1 and denoting pj (i) = V4>i(t,j(t)), for 
the first time derivative we get 

(4.4) Mt, 7(*)) = 9tMt, 7(*)) +7(*) ' V&(t, 7 (t)) = 7(*) -P?(*) " 7(*),P?(*)). 
In particular 

(4.5) 4>i(to,x ) = v* -pi - Hi, 

where pi = p]{to) and Hi = H(to, Xo,pi) as above. Using the Legendre duality (see 



(1.6) and discussion thereafter), we can modify expression (4.4) as follows: 



(4.6) <Mi, 7 (t)) = 7(i) • P ](t) - H(t, 7 (*),p7(*)) 

- (7(t) ~ v7(t)) ■ W v L(t, 7 (t),«7(t)) + L(t, 7 (*W(t)), 

where vj{t) = V p H{t, 7, V&(t, -y(t))) &ndp](t) = V v L(t,j(t),v7(t)) = V^(i, 7 (t)) 
are values of velocity and momentum that correspond to the gradient p] (t) along 
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the curve 7. Recalling the expression for Bregman divergence (2.15) 



(4.7) 



D£ (v* I v) — L(t, x,v*) — L(t, x, v) — (v* — v) ■ V v L(t, x, v), 



we can now express the time derivative fa(t,j(t)) in the form 

(4.8) <Mi,7«) = L(t,7(*),7(*)) ~ D? (t) m) \ vj{t)). 

Observe that the difference between fa(t,j(t)) and the mechanical action along 
the curve decreases as the (negative) integral over (to,t) of the Bregman diver- 
gence D£ (7 I vj). Of course subtracting the common quantity from the values 
of branches fait, j(t)) for all i does not change the mutual order of these values. We 
notice that the bigger is the Bregman divergence D'£ J ^ '^(7 | vj), the faster decreases 
this difference: up to the second order in t — t , the value min; fa will be attained 
at the branch or branches for which 7(f) is the most distant (in the Bregman sense) 
from vjit). 



To obtain the second time derivative we differentiate the r.h.s. of (4.4) to get 

«?(*)• p7(*) 

H(t, 1 (t),p](t))+j(t)-W x H(t, 1 (t),p](t)) 



(4.9) 4>i(tMt)) = m-pUt) + i(t)-pJ(t) 



dt 

It is convenient again to consider the second time derivative not of 1 
the difference between fa and the mechanical action of 7: 



itself, but of 



(4.10) fa(t, 1 (t))--L(t, 1 (t),j(t)) 



7(*) -(p7(*)-pZ(*)) + (7(*) -«?(*))• P?W 



H(t, 7(*),P7(*)) + 7(t) • V x H(t, l(t),p](t)) 



L(t, 7(*),7(*)) + 7(*) ' VxHt, j(t),j(t)) 



where pl(t) = J \/ v L(t,j(t), A f(t)) is the value of momentum corresponding to the 
velocity jit). In particular at time to we have 

(4.11) fa - ^ = a ■ (pi -p*) + (v* - Vi ) ■ fi - ([H]i + [L]i), 



p*(io) is the usual admissible momentum (cf. (2.8)), 



v7(to) 



where p 

fi = p](to), and [H]i, [L]i denote values of the two square brackets at t = t . 

Consider now an integral curve 7 that is determined by intersection of smooth 
branches fa for some i € 3. Two conditions must hold for small t — to > along 
this curve: 

(i) the velocity j(t) must be admissible at (t, j(t)), i.e., be the center of the 
"Bregman sphere" containing all vj{t) at its boundary; 

(ii) values of the remaining branches at (t, j(t)) must be greater than the com- 
mon value of fa(t,-y(t)). 

Define the piecewise linear concave function 

(4.12) F(a) = min (a ■ { Pl - p* ) + («*- + [£]*))■ 

Note that the velocity v* of the curve 7 at time to is known, and therefore the 
values of /, are the same for any integral curve 7, so the function F can be defined 
without knowing the curve 7. It is easy to see that the set 3(a) of indices where 
minimum is attained in (4.12) consists of precisely those indices for which condition 
(ii) holds. This set plays the same role in the quadratic approximation as did the 
set I*(v) in the linear approximation. 
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Condition (i) then becomes an admissibility condition for the acceleration similar 
to (2.8). Geometrically, the admissible acceleration a is the value at time to of the 
rate of change of the center of the smallest Bregman sphere containing all v i (t) for 
sufficiently small t — to; compare this description with the fact that the velocity 
-y(t) is given by this center itself. It is clear that depending on the rates it (or 
equivalently, the values Pi — fi) at time to j some of the velocities present at to may 
"sink" into the interior of the Bregman sphere for small r = t — to > 0, leaving its 
surface defined by a smaller set {vi : i € 3 2}- 

In a similar way one can define the index sets J3, 3<4, and so on. Notice that this 
decreasing sequence of index sets will stabilize, since their intersection is nonempty. 
We conjecture that the resulting set -3 = n s >i3s determines the smooth manifold to 
which the integral curve 7 belongs and which determines it uniquely as the integral 
curve of the corresponding filed of admissible velocities. 
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